| Home AMM Home Technology Services Products Partners Gallery Licensing Contact |
![]() |
||||||||||||||||||||||||||||||||||||||||||||||||||
MPEG Audio Layer-3
History
|
|||||||||||||||||||||||||||||||||||||||||||||||||||
| 1:4 | by Layer 1 (corresponds with 384 kbps for a stereo signal), |
|---|---|
| 1:6...1:8 | by Layer 2 (corresponds with 256..192 kbps for a stereo signal), |
| 1:10...1:12 | by Layer 3 (corresponds with 128..112 kbps for a stereo signal), |
Some typical performance data of
MPEG Layer-3 are:
| sound quality | bandwidth | mode | bitrate | reduction ratio |
|---|---|---|---|---|
| telephone sound | 2.5 kHz | mono | 8 kbps * | 96:1 |
| better than shortwave | 4.5 kHz | mono | 16 kbps | 48:1 |
| better than AM radio | 7.5 kHz | mono | 32 kbps | 24:1 |
| similar to FM radio | 11 kHz | stereo | 56...64 kbps | 26...24:1 |
| near-CD | 15 kHz | stereo | 96 kbps | 16:1 |
| CD | >15 kHz | stereo | 112..128kbps | 14..12:1 |
| *) Fraunhofer uses a non-ISO extension of MPEG Layer-3 for enhanced performance ("MPEG 2.5") | ||||
In all international listening tests,
MPEG Layer-3 impressively proved its superior performance, maintaining the
original sound quality at a data reduction of 1:12 (around 64 kbit/s per
audio channel). If applications may tolerate a limited bandwidth of around
10 kHz, a reasonable sound quality for stereo signals can be achieved even
at a reduction of 1:24.
For the use of low bit-rate audio
coding schemes in broadcast applications at bitrates of 60 kbit/s per
audio channel, the ITU-R recommends MPEG Layer-3. (ITU-R doc.
BS.1115)
Filter
bank
The filter bank used in MPEG Layer-3 is a hybrid
filter bank which consists of a polyphase filter bank and a Modified
Discrete Cosine Transform (MDCT). This hybrid form was chosen for reasons
of compatibility to its predecessors, Layer-1 und
Layer-2.
Perceptual Model
The perceptual
model is mainly determining the quality of a given encoder implementation.
It uses either a seperate filter bank or combines the calculation of
energy values (for the masking calculations) and the main filter bank. The
output of the perceptual model consists of values for the masking
threshold or the allowed noise for each coder partition. If the
quantization noise can be kept below the masking threshold, then the
compression results should be indistinguishable from the original
signal.
Joint Stereo
Joint stereo coding
takes advatage of the fact that both channels of a stereo channel pair
contain far the same information. These stereophonic irrelevancies and
redundancies are exploited to reduce the total bitrate. Joint stereo is
used in cases where only low bitrates are available but stereo signals are
desired.
Quantization and Coding
A system
of two nested iteration loops is the common solution for quantization and
coding in a Layer-3 encoder.
Quantization is done via a
power-law quantizer. In this way, larger values are automatically coded
with less accuracy and some noise shaping is already built into the
quantization process.
The quantized values are coded by
Huffman coding. As a specific method for entropy coding, hufman coding is
lossless. Thus is called noiseless coding because no noise is added to the
audio signal.
The process to find the optimum gain and
scalefactors for a given block, bit-rate and output from the perceptual
model is usually done by two nested iteration loops in an
analysis-by-synthesis way: